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Abstract 

This paper proposes the Potluck Problem as a model for the behav- 
ior of independent producers and consumers under standard economic 
assumptions, as a problem of resource allocation in a multi-agent sys- 
tem in which there is no explicit communication among the agents. 
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1 Introduction 

In the study of bounded rationality and inductive reasoning, Brian Arthur 
introduced the Santa Fe Bar problem (SFBP) [1]. The SFBP deals with the 
allocation of constrained resource to non-cooperating multiple agents. SFBP 
extensions have been studied in resource allocation games by different au- 
thors. For instance, Schaerf et al. [6] studied multi-agent learning in context 
of adaptive load balancing, while Galstyan et al. [3j studied resource alloca- 
tion games with changing resource capacities. In the model considered by 
Galstyan et al., there is communication among the agents before choosing 
their strategy. But here in the model we are proposing, there is no explicit 
communication among the agents. In this paper we generalize the SFBP 
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by introducing multiple producers and also by considering varying demands 
for the resources. We propose the Potluck Problem (PP) to model resource 
allocation and utilization in a multiple-producer, multiple-consumer envi- 
ronment. The model depicted by the PP is applicable in many real- world 
situations, e.g., an electrical power grid with many individual power sup- 
ply units and consumers of power. An economy in which there are multiple 
agents who predict the global behavior and take local decisions can be mod- 
eled. In a standard multi-player economic environment, price allocation in 
the presence of varying demands for the resource is a situation which re- 
sembles PP. Many distributed systems such as water management systems, 
Internet service providers where service on demand is required, etc., can be 
modeled by the PP. 

This problem is modeled by observing multiple instances of potluck dinner 
by a set of people. A potluck is a gathering of people where each person 
is expected to bring a dish of food to be shared among the group. The 
multiple persons (agents) will decide individually what quantity of food to 
contribute to the dinner, without any prior coordination among themselves. 
Such instances of dinner are repeated. The problem in deciding how much 
to contribute to the dinner is because of the varying demand for the food in 
every instance of the dinner. (For simplicity, we consider all food as consisting 
of just one dish.) 

Section [2] describes the Potluck Problem in more detail and also gives an 
explanation to how it is considered as generalizing the SFBP. In Section [3l 
it is shown that with rational learning, equilibrium state cannot be achieved 
in the PP. Then a weighted majority [3] learning algorithm is applied to 
achieve near-equilibrium behavior. Section H] discusses results of simulation 
of the PP. In the last section, possible extensions of this work are discussed. 

2 Problem Description 

Consider a scenario where N (e.g., 100) people (or agents) have a potluck 
dinner every week in a city. Each of them decides individually how much 
food to contribute to the dinner. The agents do not communicate among 
themselves except that every agent knows about the total demand and supply 
at the dinners in previous weeks. The dinner is enjoyable if there is no 
starvation or excess of food. Starvation means that the amount of food 



2 



brought to the dinner is not sufficient to serve the people in the dinner, while 
excess means that some food goes waste for being more than sufficient for 
the dinner. The demand for food by each agent varies according to variable 
individual appetite each week. An agent decides on how much to contribute 
depending on the prediction it makes for the demand and supply in that 
week. 

Specifically, as described we have a supply-side problem, but a corresponding 
demand-side problem can also be formulated where the available supply of a 
resource varies and demand is to be adjusted accordingly. The demand-side 
problem has applications in electrical power technologies, where it is called 
demand response [7\, and elsewhere. The model and results are very similar, 
however, so we do not describe this in detail. 

In a game-theoretic fashion, the Potluck Problem can be described as a re- 
peated, non-cooperative game. Say there are agents who are players in 
the game. Consider one instance of the game (one "dinner"), t. For a player 
i < N, the strategy set is < < MaXi, where Qi is the quantity of food 
carried to a dinner by agent i, and MaXi corresponds to going to dinner 
with the maximum supply of food that that agent i can take. Let M, denote 
the set of probability distributions over Qi which defines the mixed strategy 
for agent i. Now say Si^t G Mj indicate the mixed strategy of the player i 
at instance t of dinner, then the total source of food to the dinner will be 
St = Z^iLi ^i,t- The agent decides on the mixed strategy Si^t by predicting the 
total demand for the dinner which is denoted by Pi^f 

In every dinner all/some of the agents also act as consumers by consuming 
the food brought to the dinner. The agents' demand for the food also varies 
over different dinners. The demand for food by an agent i, at an instance t 
is given by di^f So the total demand for food in the dinner is Dt = J27=i di,t- 
Ideally, the dinner is enjoyable if St = Dt. This state of the game, where the 
supply and demand exactly match, is the equilibrium state in the Potluck 
Problem. If St < Dt, then there is starvation at the dinner, and if St > Dt 
there is an excess. The Potluck Problem is a repeated game of such instances. 

Remark 2.1. The Potluck Problem is a generalization of the SFBP. 

Consider of the Potluck Problem in which the demand for the resource 

is fixed over all iterations of the dinner, i.e., Vt, Dt = d such that d < N. Also 
assume the strategy set for all agents is constrained to take discrete values. 
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i.e., Qi = {0,1}, i < N. Now at an instance t of the dinner, if St < Df, 
then all players will increase their supply by choosing Si^t = 1- This will 
result in making the total supply to the dinner more than the demand, i.e., 
Ya=i ^i,t = which will cause an excess {St > d) . Then all players will 
choose a strategy Si,t = for the next dinner, causing starvation. This kind 
of oscillatory behavior is similar to that in the Santa Fe Bar Problem. 

To observe the similarity with the SFBP, consider d = 60 for all instances 
of the game. For each player the strategy set Qi = {0,1} corresponds to 
choosing not to attend the bar and choosing to attend the bar respectively. 
An under-crowded bar is one in which there is a resource (place in the bar) 
which is going waste. This is similar to excess in the PP. An overcrowded 
bar likewise resembles starvation in the PP. Thus the Potluck Problem is a 
generalization of the SFBP. 

3 Impossibility of Rational Learning 

The oscillatory behavior that is known to arise the PP shows the need to 
have some predictive mechanism by which agents can foresee the demand for 
the coming dinner and then decide upon on the supply they want to bring 
to achieve the equilibrium in the PP. 

3.1 Best-Reply Dynamics and e-Predictive Learning 

This section formalizes the argument pertaining to oscillatory behavior ob- 
served in the PP. First we note the standard best-reply dynamics and then 
explain the same in the context of the PP. 

The best reply dynamics is often termed as the Cournot adjustment model or 
Cournot learning after Augustin Cournot who first proposed it in the context 
of a duopoly model [2]. The best reply dynamics can be seen as a game in 
which each self-interested agent assumes that every other similar agent uses 
the same strategy in later periods that is similar to one most recently used. In 
similar vein, a learning algorithm used by an agent is predictive if it correctly 
matches up with the situations created by the agents. 

In the present context, we can say the following. 
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Definition 3.1. (i) An agent i is said to be employ the best-reply dy- 
namics in the PP, iff for all t , the player assumes that Pj t = -Dj-i and 
decides on the supply Si^t the player wants to contribute. 

(ii) A learning algorithm is said to be e-predictive in the PP, iff it generates 
a sequence of beliefs Pi^t for a player i, such that — Dt\ < e, for 
some e > 0. 

If we say agent i is able to predict using a learning algorithm, then the dif- 
ference between its prediction about the next week's demand and the actual 
demand for that particular week should be zero, or at least less than some e. 
An agent i is said to be rational, iff it plays only best-replies to its beliefs ^ . 
When playing rational, the agent assumes that the demand for the coming 
week is going to be same as the previous week, i.e., Pi^t = Dt-i- 

Remark 3.2. In the PP, if all the agents are rational and e-predictive, then 
Vt, \Dt — Dt+i\ < e, i.e., the variation in total demand between successive 
dinners should at most be e. 

However, given that the variation in total demand need not be bounded by e, 
it thus follows that rational agents will not be e-predictive. As a consequence, 
we have the following. 

Remark 3.3. In the PP, it is impossible to achieve equilibrium if agents 
employ rational learning. 

This in turn illustrates the need for agents to learn using better principles 
than best-reply dynamics. Such learning is called non-rational learning. 

3.2 An Algorithm for Non-Rational Learning 

As discussed in the previous section, a perfect rational approach may lead to 
oscillations in systems which resemble the behavior predicted in the Potluck 
Problem. So the agents in the game need a learning which is not perfectly 
rational. 

Definition 3.4. A predictor makes use of the previous dinner's data avail- 
able and makes a prediction for the demand of the dinner for the coming 
week. It is a function which uses St-i,St-2 ,---St~x (past x week's data) and 
Dt-i,Dt-.2,---Dt-x and predicts the demand for the coming week. 
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Various predictors that can be considered by an agent in the PP include: 

• Average demand over the last j (e.g., 10) instances of dinner. 

• Randomly choose the demand of one of weeks from the last j (e.g., 10) 
instances. 

• The rational predictor (presume that consumption the next time will 
be the same as the last time). 

• An oracle which gives consumption. 

• A time-varying function. 

The initial choice of predictors depends on the kind of system we are trying to 
model. In some, a time-varying function of demand should be incorporated 
as a predictor, e.g., in forecasting the demand for electrical power (which is 
typically higher during daytime than at night). In others, an oracle which 
forecasts demand (based on data from outside the system not known to 
agents) may be appropriate. 

Out of the various predictors available, each agent randomly chooses k pre- 
dictors. Then each agent has k predictions about the demand at the coming 
dinner, each of which is denoted by representing the prediction made 

by agent i's p^^ predictor for dinner t. The agent i decides the supply si^t to 
be taken to the dinner t based on the forecasts of those k predictors, using 
a weighted majority approach [5]. Each agent i maintains a weight Wi^p^t for 
each predictor p at time t, and updates it after each iteration of the game, 
with the weight of accurate predictors increasing and that of inaccurate ones 
decreasing. The initial weights of all predictors may be equal or some random 
non-zero values. 

The iterative update and learning algorithm used by the players can be sum- 
marized in the following steps. 

For an agent z, during dinner t, do the following. 

• Predict demand using all the predictors, i.e., find Ojpt,p = 1,2, ... ,k. 

• Predict the demand Pi^t by using all the predictions, using a weighted 
majority algorithm. 
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• Decide on the supply Si^t to be brought to the dinner. 

• Update weights of all the predictors based on the actual demand and 
supply at dinner t. 

The prediction for a particular instance of dinner is calculated by taking the 
weighted majority of the predictions made by the all predictors of the 
agent, i.e., 

After every instance of dinner, an agent updates the weights of all its pre- 
dictors based on how close they were to predicting the actual demand. 

The update equation of the weight of a predictor p at an instance t is 
= '^i,p,t X where F = (3", with being a parameter chosen 
such that < /3 < 1. If %^ > 1, then v is set to and if %^ < 1, then 



After updating all the weights, they are normalized to between and 1 using 
the equation 
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Figure 1: Resource excess/starvation occurring in the PP 
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3.3 Simulation Results 

The Potluck Problem has been simulated with 100 non-identical agents with 
production capacities in the range of 500 to 1000 discrete units, and con- 
sumptions in the range of to 1000, with 1000 instances of dinner. The 
mean consumption of all agents together came to 48,200. The mean pro- 
duction under rational learning was 47580, and under weighted majority 
learning was 48284. The weighted majority approach with the five simple 
predictors listed previously outperformed the rational approach about 99.5% 
of the time, and resulted in a level of starvation/excess that was 22.6% better 
than in the rational approach on average, and about 41.5% in the best case. 
(More intricate predictors are seen to yield even better results.) The results 
are depicted graphically in Figure [H 

4 Conclusion and Future Work 

This paper proposed and analyzed the nature of the Potluck Problem. It is 
observed that a weighted majority learning approach results in better parity 
between demand and supply, compared to rational learning. Though in the 
present work, we have modeled the problem as only one resource being allo- 
cated/consumed, the problem can be easily extended by considering multiple 
resources, which are needed in specific, possibly unforeseen, proportions for 
better utility. In the current analysis, each agent chooses predictors from 
a pool of predictors available. This problem can be studied also by each 
agent choosing a different set of predictors from others, which closely resem- 
bles social behavior, and different sets of predictors can be compared and 
evaluated. Various predictor behaviors and performance can also be studied 
for specific patterns of demand, which can help in applying this model in a 
realistic scenarios. 

References 

[1] Arthur, W. B. Inductive reasoning and bounded rationality. American 
Economic Review 84, 2 (1994), 406-411. This and other some related work 
may be seen at |http : //www . santaf e . edu/ arthur/Pa pers , the author's 
web page. 



8 



[2] CoURNOT, A. A. Recherches sur les Principes Mathematiques de la 
Theorie des Richesses. Calmann-Levy, Paris, 1974. Originally published 
in 1838, this was re-printed (H. Guitton, ed.) as part of a collection titled 
Les Fondateurs. 

[3] Galstyan, a., Kolar, S., and Lerman, K. Resource allocation 
games with changing resource capacities. In Proceedings of the Inter- 
national Conference on Autonomous Agents and Multi-Agent Systems 
(AAMAS-2003) (2003). 

[4] Greenwald, a. R., Mishra, B., and Parikh, R. The Santa Fe 
Bar Problem revisited: Theoretical and practical implications, Apr. 1998. 
Unpublished Manuscript. 

[5] LiTTLESTONE, N., AND Warmuth, M. K. The weighted majority 
algorithm. Information and Computation 108, 2 (Feb. 1994), 212-261. 

[6] ScHAERF, A., Shoham, Y., AND Tennenholtz, M. Adaptive load 
balancing: A study in multi-agent learning. Journal of Artificial Intelli- 
gence Research 2 (1995), 475-500. 

[7] Spees, K., and Lave, L. Demand response and elec- 
tricity market efficiency. Carnegie Mellon Electricity Industry 
Center Working Paper CEIC-07-01 (Jan. 2007). Available at 
http : //www . emu . edu/ electricity 



9 



